Search CORE

322 research outputs found

Truth Discovery in Crowdsourced Detection of Spatial Events

Author: Bishop C. M.
Dawid A. P.
Pasternack J.
Qi G.-J.
Raykar V. C.
Wang D.
Welinder P.
Whitehill J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/11/2014
Field of study

Postprin

Aberdeen University Research

Crossref

Southampton (e-Prints Soton)

University of St. Andrews - Pure

Globally Optimal Crowdsourcing Quality Management

Author: Buckley C.
Carpenter B.
Chen X.
Karger D. R.
Liu Q.
Raykar V. C.
Sheshadri A.
Welinder P.
Whitehill J.
Zhou D.
Zhou D.
Publication venue
Publication date: 01/03/2015
Field of study

We study crowdsourcing quality management, that is, given worker responses to a set of tasks, our goal is to jointly estimate the true answers for the tasks, as well as the quality of the workers. Prior work on this problem relies primarily on applying Expectation-Maximization (EM) on the underlying maximum likelihood problem to estimate true answers as well as worker quality. Unfortunately, EM only provides a locally optimal solution rather than a globally optimal one. Other solutions to the problem (that do not leverage EM) fail to provide global optimality guarantees as well. In this paper, we focus on filtering, where tasks require the evaluation of a yes/no predicate, and rating, where tasks elicit integer scores from a finite domain. We design algorithms for finding the global optimal estimates of correct task answers and worker quality for the underlying maximum likelihood problem, and characterize the complexity of these algorithms. Our algorithms conceptually consider all mappings from tasks to true answers (typically a very large number), leveraging two key ideas to reduce, by several orders of magnitude, the number of mappings under consideration, while preserving optimality. We also demonstrate that these algorithms often find more accurate estimates than EM-based algorithms. This paper makes an important contribution towards understanding the inherent complexity of globally optimal crowdsourcing quality management

arXiv.org e-Print Archive

CiteSeerX

Crossref

eScholarship - University of California

Determining the Best Protocol for Raising Larvae of the Sea Urchin \u3cem\u3eEucidaris Tribuloides\u3c/em\u3e

Author: Balser Faculty Advisor, Elizabeth J.
Butler Kimberlee M.
Whitehill Elizabeth A. G.
Publication venue: Digital Commons @ IWU
Publication date: 17/04/2004
Field of study

Digital Commons @ Illinois Wesleyan University

Exploration of the Genetic Epidemiology of Asthma: A Review, With a Focus on Prevalence in Children and Adolescents in the Caribbean

Author: Kumar A.
Mohan A.
Mohan A.
Roberto A. J.
Whitehill B. C.
Publication venue: ODU Digital Commons
Publication date: 01/01/2014
Field of study

Asthma is a chronic disease caused by the inflammation of the main air passages of the lungs. This paper outlines a review of the published literature on asthma. While a few studies show a trend of rising asthma cases in the Caribbean region, even fewer have explored the genetic epidemiological factors of asthma. This is a literature review that seeks to sum the body of knowledge on the epidemiology of asthma. Specifically, the major objective of the literature review is to provide a unified information base on the current state of factors involved in the genetic epidemiology of asthma. The review is a simple, yet detailed summary of the literature sources and their methodology and findings on the genetic epidemiology of asthma. Further, it seeks to direct this effort to the Caribbean region. The paper then reviews a summarized and synthesized collection of the body of previous research. Of specific interest are peer-reviewed sources that have been published in recent times. The paper provides more recent insight and recapitulates on the previous research, while tracing the intellectual progress on the debate. Where possible, reviewing and discussing the results of the previous literature, this review singles out the gaps and potential future research directions for studying the genetic epidemiology of asthma. Overall, we hope to contribute to a more synthesized knowledge and improved understanding of the previous literature and future potential direction of genetic and epidemiological asthma research

Old Dominion University

Recommended from our members

Incidence of Pediatric Cannabis Exposure Among Children and Teenagers Aged 0 to 19 Years Before and After Medical Marijuana Legalization in Massachusetts

Author: Bhutta Waqaas A.
Burns Michael D.
Chary Michael
Harrington Calla
Lang Cheryl J.
Whitehill Jennifer M.
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2019
Field of study

Importance Pediatric health care contacts due to cannabis exposure increased in Colorado and Washington State after cannabis (marijuana) policies became more liberal, but evidence from other US states is limited. Objective To document the incidence of pediatric cannabis exposure cases reported to the Regional Center for Poison Control and Prevention (RPC) before and after medical marijuana legalization (MML) in Massachusetts. Design, Setting, and Participants Cross-sectional comparison of pediatric cannabis exposure cases 4 years before and after MML in Massachusetts. The exposure cases included those of 218 children and teenagers aged between 0 and 19 years, as reported to the RPC from 2009 to 2016. Census data were used to determine the incidence. Data analysis was performed from November 12, 2018, to July 20, 2019. Exposure Cannabis products. Main Outcomes and Measures Incidence of RPC-reported cannabis exposure cases, both single substance and polysubstance, for the age group of 0 to 19 years, and cannabis product type, coingestants, and clinical effects. Results During the 8-year study period (2009-2016), the RPC received 218 calls involving cannabis exposure (98 single substance, 120 polysubstance) in children and teenagers aged 0 to 19 years, representing 0.15% of all RPC calls in that age group for that period. Of the total exposure cases, males accounted for 132 (60.6%) and females 86 (39.4%). The incidence of single-substance cannabis calls increased from 0.4 per 100 000 population before MML to 1.1 per 100 000 population after (incidence rate ratio, 2.4; 95% CI, 1.5-3.9), a 140% increase. The age group of 15 to 19 years had the highest frequency of RPC-reported cannabis exposures (178 calls [81.7%]). The proportion of all RPC calls due to single-substance cannabis exposure increased overall for all age groups from 29 before MML to 69 afterward. Exposure to edible products increased after MML for most age groups. Conclusions and Relevance Pediatric cannabis exposure cases increased in Massachusetts after medical marijuana was legalized in 2012, despite using childproof packaging and warning labels. This study provides additional evidence suggesting that MML may be associated with an increase in cannabis exposure cases among very young children, and extends prior work showing that teenagers are also experiencing increased cannabis-related health system contacts via the RPC. Additional efforts are needed to keep higher-potency edible products and concentrated extracts from children and teenagers, especially considering the MML and retail cannabis sales in an increasing number of US states

ScholarWorks@UMass Amherst

Limitations of Majority Agreement in Crowdsourced Image Interpretation

Author: Bachrach Y
Bohannon J
Fritz S
Howe J
R Core Team
Raddick M J
Salk C F
See L
Wang J
Welinder P
Whitehill J
Publication venue: 'Wiley'
Publication date: 01/03/2016
Field of study

Crowdsourcing can efficiently complete tasks that are difficult to automate, but the quality of crowdsourced data is tricky to evaluate. Algorithms to grade volunteer work often assume that all tasks are similarly difficult, an assumption that is frequently false. We use a cropland identification game with over 2,600 participants and 165,000 unique tasks to investigate how best to evaluate the difficulty of crowdsourced tasks and to what extent this is possible based on volunteer responses alone. Inter-volunteer agreement exceeded 90% for about 80% of the images and was negatively correlated with volunteer-expressed uncertainty about image classification. A total of 343 relatively difficult images were independently classified as cropland, non-cropland or impossible by two experts. The experts disagreed weakly (one said impossible while the other rated as cropland or non-cropland) on 27% of the images, but disagreed strongly (cropland vs. non-cropland) on only 7%. Inter-volunteer disagreement increased significantly with inter-expert disagreement. While volunteers agreed with expert classifications for most images, over 20% would have been mis-categorized if only the volunteers’ majority vote was used. We end with a series of recommendations for managing the challenges posed by heterogeneous tasks in crowdsourcing campaigns

Crossref

International Institute for Applied Systems Analysis (IIASA)

Gluon helicity from global analysis of experimental data and lattice QCD Ioffe time distributions

Author: Karpie J.
Melnitchouk W.
Monahan C.
Orginos K.
Qiu J. -W.
Richards D. G.
Sato N.
Whitehill R. M.
Zafeiropoulos S.
Publication venue
Publication date: 27/10/2023
Field of study

We perform a new global analysis of spin-dependent parton distribution functions with the inclusion of Ioffe time pseudo-distributions computed in lattice QCD (LQCD), which are directly sensitive to the gluon helicity distribution,

\Delta g

. These lattice data have an analogous relationship to parton distributions as do experimental cross sections, and can be readily included in global analyses. We focus in particular on the constraining capability of current LQCD data on the sign of

\Delta g

at intermediate parton momentum fractions

x

, which was recently brought into question by analysis of data in the absence of parton positivity constraints. We find that present LQCD data cannot discriminate between positive and negative

\Delta g

solutions, although significant changes in the solutions for both the gluon and quark sectors are observed.Comment: 24 pages, 7 figure

arXiv.org e-Print Archive

Efficient crowdsourcing for multi-class labeling

Author: David R. Karger
Devavrat Shah
Jin R.
Karger D. R.
Liu Q.
Raykar V. C.
Sewoong Oh
Welinder P.
Whitehill J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2013
Field of study

Crowdsourcing systems like Amazon's Mechanical Turk have emerged as an effective large-scale human-powered platform for performing tasks in domains such as image classification, data entry, recommendation, and proofreading. Since workers are low-paid (a few cents per task) and tasks performed are monotonous, the answers obtained are noisy and hence unreliable. To obtain reliable estimates, it is essential to utilize appropriate inference algorithms (e.g. Majority voting) coupled with structured redundancy through task assignment. Our goal is to obtain the best possible trade-off between reliability and redundancy. In this paper, we consider a general probabilistic model for noisy observations for crowd-sourcing systems and pose the problem of minimizing the total price (i.e. redundancy) that must be paid to achieve a target overall reliability. Concretely, we show that it is possible to obtain an answer to each task correctly with probability 1-ε as long as the redundancy per task is O((K/q) log (K/ε)), where each task can have any of the

K

distinct answers equally likely, q is the crowd-quality parameter that is defined through a probabilistic model. Further, effectively this is the best possible redundancy-accuracy trade-off any system design can achieve. Such a single-parameter crisp characterization of the (order-)optimal trade-off between redundancy and reliability has various useful operational consequences. Further, we analyze the robustness of our approach in the presence of adversarial workers and provide a bound on their influence on the redundancy-accuracy trade-off. Unlike recent prior work [GKM11, KOS11, KOS11], our result applies to non-binary (i.e. K>2) tasks. In effect, we utilize algorithms for binary tasks (with inhomogeneous error model unlike that in [GKM11, KOS11, KOS11]) as key subroutine to obtain answers for K-ary tasks. Technically, the algorithm is based on low-rank approximation of weighted adjacency matrix for a random regular bipartite graph, weighted according to the answers provided by the workers.National Science Foundation (U.S.

DSpace@MIT

Crossref

Approximate Nearest Neighbor Search to Support Manual Image Annotation of Large Domain-specific Datasets

Author: Boom B.
Lowe D. G.
Raykar V. C.
Weiss Y.
Whitehill J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2013
Field of study

Crossref

Edinburgh Research Explorer

Affective Man-Machine Interface: Unveiling human emotions through biosignals

Author: A. Choi
A. Daly
A. Haag
A.C. Rencher
A.J. Fridlund
A.M. Kring
B. Schölkopf
B.L. Frederickson
C. Liu
C. Liu
C.D. Katsis
C.L. Cooper
C.L. Lisetti
C.M. Bishop
C.M.A.V. Ravenswaaij-Arts
D. Grandjean
D. Ververidis
E. Aarts
E. Leon
E.A. Butler
E.L. Broek Van den
E.L. Broek van den
E.L. Broek van den
E.L. Broek van den
F. Lotte
F. Nasoz
G.F. Solomon
G.G. Berntson
G.H.E. Gendolla
G.N. Yannakakis
H. Gunes
H.D. Critchley
I.B. Mauss
J. Cacioppo
J. Kim
J. Scheirer
J. Whitehill
J. Zhai
J.A. Healey
J.A. Russel
J.A. Russell
J.H.D.M. Westerink
J.L.H. Schuler
J.T. Cacioppo
K.H. Kim
L.F. Barrett
M. Marwitz
M. Minsky
M. Tulder van
M.B.I. Reaz
P. Carrera
P. Grossman
P. Lukowicz
P. Rani
P. Rani
R. Ader
R. Sinha
R.W. Picard
R.W. Picard
S.D. Kreibig
S.H. Fairclough
S.K. Yoo
T.M. Cover
T.M. Mitchell
Task Force
W. Boucsein
W. James
Z. Zeng
Publication venue: Springer Verlag
Publication date: 01/01/2010
Field of study

As is known for centuries, humans exhibit an electrical profile. This profile is altered through various psychological and physiological processes, which can be measured through biosignals; e.g., electromyography (EMG) and electrodermal activity (EDA). These biosignals can reveal our emotions and, as such, can serve as an advanced man-machine interface (MMI) for empathic consumer products. However, such a MMI requires the correct classification of biosignals to emotion classes. This chapter starts with an introduction on biosignals for emotion detection. Next, a state-of-the-art review is presented on automatic emotion classification. Moreover, guidelines are presented for affective MMI. Subsequently, a research is presented that explores the use of EDA and three facial EMG signals to determine neutral, positive, negative, and mixed emotions, using recordings of 21 people. A range of techniques is tested, which resulted in a generic framework for automated emotion classification with up to 61.31% correct classification of the four emotion classes, without the need of personal profiles. Among various other directives for future research, the results emphasize the need for parallel processing of multiple biosignals

Crossref

Repository TU/e

Pure OAI Repository

University of Twente Research Information